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ABSTRACT 

Covalent binding is an important mechanism for 
many drugs to gain its function. We developed a 
computational algorithm to model this chemical 
event and extended it to a web server, the 
CovalentDock Cloud, to make it accessible directly 
online without any local installation and configur- 
ation. It provides a simple yet user-friendly web 
interface to perform covalent docking experiments 
and analysis online. The web server accepts the 
structures of both the ligand and the receptor 
uploaded by the user or retrieved from online data- 
bases with valid access id. It identifies the potential 
covalent binding patterns, carries out the covalent 
docking experiments and provides visualization of 
the result for user analysis. This web server is free 
and open to all users at http://docking.sce.ntu.edu. 
sg/. 

INTRODUCTION 

The prediction of the interaction between the proteins and 
small molecules, often through molecular docking, serves 
as an important step in rational drug discovery pipeline 
(1,2). Despite the fact that these interactions can be either 
non-covalent or covalent, few of the current molecular 
docking programs can account the formation of covalent 
bond during docking properly (3-6), if not incapable of 
doing it at all. However, covalent binding is important and 
common. It is reported that 3 of the 10 top-selling drugs in 
USA in 2009 are covalent drugs (7). 

To formulate and model covalent docking more accur- 
ately and to enable large-scale covalent virtual screening, 
we developed the software package CovalentDock (8). 
The package has an empirically calibrated energy model 
to estimate the contribution of the covalent binding. It can 
also automatically detect and process structural patterns 
to determine which covalent linkage is most probable. 



Although best effort for user-friendliness has been 
observed for the CovalentDock package, many users still 
experienced difficulties in compiling, installing or using the 
software owing to the diverse variation of their machines, 
especially for those who lack in such technical expertise. 
Moreover, owing to the fact that there are multiple 
options for the molecular structures to consider owing 
to chirality when forming a covalent linkage, the perform- 
ance of running CovalentDock on an average workstation 
is not desirable. 

With such limitation and desire to make the computa- 
tional tool available to more users, we present a fully func- 
tional web server, the CovalentDock Cloud, for 
automated covalent docking. It allows the users to carry 
out covalent docking experiments with their own data and 
analyze the results via a web interface, without any con- 
figuration effort on their local machines. This web server is 
publicly available at http://docking.sce.ntu.edu.sg/. It is 
free and open to all users, and no login is required. 

MATERIALS AND METHODS 

Covalent docking with empirical energy estimation of 
covalent linkage formation 

The CovalentDock Cloud web server uses the 
CovalentDock in its backend. CovalentDock (8) is a 
software package inherently considering covalent 
docking, which is capable of the automatic identification 
of the chemical patterns suitable for covalent linkage for- 
mation, pre-processing the structures by altering them to 
reflect the nature of covalent binding, taking into account 
of the chirality during structure change and molecular 
geometry constrains. 

One unique contribution we developed for 
CovalentDock is the energy estimation for the formation 
of the covalent linkage. The interaction between the ligand 
and its receptor is still modeled the same as in conven- 
tional molecular dockings through non-covalent inter- 
actions such as van der Waals, electrostatics, solvent 
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effect or so. The additional energy contribution from 
covalent linkage formation is estimated by a newly 
formulated model: 



£)(g-2a(r-/'o) _ 2e~ a( - r ~ r ^) 

0, 



TAS+C, 



r < r m 
r > r m 



where D is the dissociation energy, a is a parameter 
controlling the well width, r is the bond length, and r 0 is 
ideal equilibrium bond length, T is the system tempera- 
ture, AS is the conformation entropy estimated by 
quantum mechanics simulation, C is an empirical correc- 
tion constant to make it compatible with the energy 



contribution from non-covalent interaction and r m is the 
maximum bond length without disassociation. The par- 
ameterization of this equation is subjected to the specified 
type of covalent bonding pairs. 

This energy model, together with the automation of pre- 
processing, made the fully automated covalent docking 
and covalent virtual screening possible. Readers can 
refer to the CovalentDock article (8) for the description 
on the technical details and how it works. 

Input 

CovalentDock Cloud has a simple and easy-to-use inter- 
face for new job submission with user-specified data. 



Job token: 2222222222222222222222222222222222222222 
Please make a note of this token or bookmark this page for future access. 
Status: Finished! 



You may need to install Java to use Jmol for visualization. Refer to this Mouse Manual from Jmol wiki for a quick start. 




Select for Chirality 

ligand_bond_2_C1 1_1 _mini |y_| 
Cluster Estimated AG 
Rank (kcal/mol) 



-13.16 
-10.83 
-9.68 
-6.26 
-5.88 



View control 

Color: Highlight ~~\*\ 
Style: Highlight L*j 

EShow surface 
□ Spin 



c Receptors are represented in grey trace with semi-transparent surfaces. The binding pocket is shown as blue st 
' Potential covalent linking residues within the binding pocket are highlighted and labeled as ['Residue Type']'Se 
c Ligands are represented^ in white sticks with covalent linkable atoms colored red. 
1 The covalent linkages fif there is any formed during docking) are also in red sticks. 
1 The bond length of the covalent linkages are also shown for your reference. 



jence number' : 'Chain identifier' 



Download the result for further investigation. 

Creation time: April 3, 2013, 3:54 p.m. 
Start time: April 3, 2013, 3:55 p.m. 
Finish time: April 3, 2013, 4:09 p.m. 

Figure 1. The screenshot of a typical result page from CovalentDock Cloud. The structure of the covalent docking result is visualized with Jmol 
plugin. The receptor structure is presented as gray backbone trace with the binding pocket highlighted in blue sticks, whereas the ligand is in white. 
All potential covalent linking residues in the binding pocket are specially colored by their residue type. The covalent linkage formed during docking 
between the ligand, and the receptor is emphasized in red. The length of this covalent bond is also marked on the canvas for reference. A simple 
control panel is attached on the right to allow the users to choose which configuration of the ligand to display and to show the rank and estimated 
free energy of each result cluster, as well as some options to alter the visualization for better inspection. The results are also available for down- 
loading in case further investigation is necessary. Some other information about the covalent docking job such as the status and timing information is 
also available on this page. The all-two token in the screenshot is the result of deliberate manipulation to make it a special live sample. Real tokens 
are much more unpredictable and secure from competent forgery. 
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When creating a new covalent docking job, the user needs 
to specify the structures of both the receptor and the 
ligand. For receptors, user can submit them in as PDB 
files, while MOL2 files are accepted for ligands. The web 
site also provides an option for the users to retrieve struc- 
tures from online databases. With valid access IDs, 
receptor structures can be fetched from RCSB PDB (9), 
and ligand structures can be retrieved from ZINC (10). As 
the underlying covalent docking protocol is not a blind- 
docking protocol, knowledge about the binding site is also 
required. The user only needs to specify the position of the 
binding site by the target residue identifier or more directly 
by the Cartesian coordinates of its center, and the size of 
the binding site will be automatically determined accord- 
ing to the ligand size. 

The web server searches the uploaded structures for 
supported chemical patterns before proceeding to 
perform covalent docking. This mainly involves the iden- 
tification of electrophile-nucleophile pairs with one in 
the ligand and the other in the receptor. Currently, the 
backend CovalentDock features the mechanism of the 
Michael additions and of the ring opening of (3-lactams, 
both of which are prominent representatives of covalent 
binding. More chemical patterns will be available in near 
future as the CovalentDock package evolves. 

Output 

On successful submission of a new job, a unique token will 
be given. The token is generated as a salted 160-bit SHA-1 
digest, which is proven to be secure from competent 
forgery. It is also the algorithm standardized and recom- 
mended by US government for collision resistance 
applications (11). The covalent docking job will be access- 
ible if and only if a valid token is presented, preserving the 
confidentiality of the user data submitted to the server. All 
data submitted to the server will be kept on the server for a 
limited period (15 days for successful finished jobs and 
24 h for jobs terminated due to errors) and destroyed 
afterward. 

The backend CovalentDock will perform 10 independ- 
ent covalent docking experiments for each job. The results 
will be clustered by their structural similarity. On success- 
ful job finishing, the results will be displayed in a Jmol 
plugin, with a visualization emphasis on the newly 
formed covalent linkage, if there is any. A control panel 
is also presented alongside the visualization window to 
allow the user the choice on which of the configuration 
results to be displayed. The results are also available for 
download, in case further investigation is necessary. The 
screenshot of a typical covalent docking result page is 
shown in Figure 1. 

Web server implementation 

The backend CovalentDock package is implemented in 
C++ and expanded from the source code of 
Autodock4.2 (3), accompanied with Python and bash 
scripts for automation. The web site is implemented with 
the Django web framework and MySQL database. The 
web server will perform input validation on job creation 
and send the data to the Nanyang Analytics HPC Cluster 



in Nanyang Technological University for actual docking 
experiments. Only the web server is open to the Internet, 
and the cluster is secured by the enterprise firewall for 
maximum security of the user data. 



RESULTS 

The backend CovalentDock package has been validated 
on a data set of 76 complexes discussed in literature, all of 
which are experimentally observed with covalent linkage 
formed between the ligand and the receptor. Among the 
76 complexes, 13 are with Michael acceptors, and the rest 
are in P-lactam family. 

The benchmark results suggest that CovalentDock is 
highly capable of producing accurate covalent docking 
results. It achieved an average root-mean-square deviation 
(RMSD) of 1.68 A when comparing the predicted and the 
native ligand structure, significantly outperforming 
the average RMSD of 2.49 A and 3.69 A achieved by the 
covalent docking methods from Autodock (3) and 
GOLD (4), respectively (8). CovalentDock also achieved 
a better hit rate (the percentage of results with an RMSD 
within a given threshold) when compared with the other 
two covalent docking protocols, as shown in Figure 2 
where the hit rate is calculated with different thresholds 
(8). 

In addition, detailed analysis also suggests that 
CovalentDock has a strong chirality and target selectivity 
during covalent docking, and it preserves the molecular 
geometry of the structures very well. Readers can refer 
to the CovalentDock article (8) for detailed description 
on validation and case studies. 



CONCLUSION 

The prediction and quantification of covalent linkage 
formation in molecular docking is of great interest and 




Cutoff RMSD 



Figure 2. The hit rate of CovalentDock, Autodock and GOLD on the 
data set of 76 covalently bound complexes under different threshold 
RMSD. The hit rate under a given threshold RMSD is calculated as 
the percentage of the results with an RMSD smaller than the threshold. 
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potential to the systematic discovery of covalent 
drugs. The CovalentDock Cloud provides a simple yet 
user-friendly portal to carry out covalent docking experi- 
ments and to analyze the result online via a web browser. 
With the powerful backend docking engine, we believe 
that this web server will offer a more accessible way for 
simulation and prediction of more accurate covalent 
docking. 

In the future, it is planned to fine-tune the web site for 
better user-experience and to continuously develop and 
enhance the CovalentDock package. It is also planned 
to make more reactions adopting covalent binding mech- 
anism available and allow users to specify the covalent 
linking pattern by themselves. 
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